Mining Interesting Rules Without Support Requirement: A General Universal Existential Upward Closure Property

نویسندگان

  • Yannick Le Bras
  • Philippe Lenca
  • Stéphane Lallich
چکیده

Many studies have shown the limits of support/confidence framework used in Apriori-like algorithms to mine association rules. There are a lot of efficient implementations based on the antimonotony property of the support. But candidate set generation is still costly and many rules are uninteresting or redundant. In addition one can miss interesting rules like nuggets. We are thus facing a complexity issue and a quality issue. One solution is to get rid of frequent itemset mining and to focus as soon as possible on interesting rules. For that purpose algorithmic properties were first studied, especially for the confidence. They allow to find all confident rules without a preliminary support pruning. More recently, in the case of class association rules, the Universal Existential Upward Closure property of confidence has been exploited in an efficient manner. Indeed, it allows to use a pruning strategy for an Apriori-like but top down associative classification rules algorithm. We present a new formal framework which allows us to make the link between analytic and algorithmic properties of the measures. We then apply this framework to propose a General Universal Existential Upward Closure. We demonstrate a necessary and a sufficient condition of existence for this property. These results are then applied to 32 measures and we show that 13 of them do have the GUEUC property. · 1 Institut TELECOM; TELECOM Bretagne UMR CNRS 3192 Lab-STICC Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3 France · 2 Université de Lyon, Laboratoire ERIC, Lyon 2, France · 3 Université Européenne de Bretagne, France

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Algorithm for Mining Association Rules with Weighted Minimum Supports

Most existing algorithms employ a uniform minimum support for mining association rules. Nevertheless, each item in a publication database, even each set of items, is exhibited in an individual period. A reasonable minimum support threshold has to be adjusted according to the exhibition period of each k-itQmsQt Accordingly, this paper proposes a new algorithm, called WMS, for mining association ...

متن کامل

Fuzzy Weighted Association Rule Mining with Weighted Support and Confidence Framework

In this paper we extend the problem of mining weighted association rules. A classical model of boolean and fuzzy quantitative association rule mining is adopted to address the issue of invalidation of downward closure property (DCP) in weighted association rule mining where each item is assigned a weight according to its significance w.r.t some user defined criteria. Most works on DCP so far st...

متن کامل

Alternative Interest Measures for Mining Associations in Databases

Data mining is defined as the process of discovering significant and potentially useful patterns in large volumes of data. Discovering associations between items in a large database is one such data mining activity. In finding associations, support is used as an indicator as to whether an association is interesting. In this paper, we discuss three alternative interest measures for associations:...

متن کامل

Numeric Multi-Objective Rule Mining Using Simulated Annealing Algorithm

Abstract as a single objective one. Measures like support, confidence and other interestingness criteria which are used for evaluating a rule, can be thought of as different objectives of association rule mining problem. Support count is the number of records, which satisfies all the conditions that exist in the rule. This objective represents the accuracy of the rules extracted from the da...

متن کامل

A Regression-Based Approach for Improving the Association Rule Mining through Predicting the Number of Rules on General Datasets

Association rule mining is one of the useful techniques in data mining and knowledge discovery that extracts interesting relationships between items in datasets. Generally, the number of association rules in a particular dataset mainly depends on the measures of ’support’ and ’confidence’. To choose the number of useful rules, normally, the measures of ’support’ and ’confidence’ need to be trie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010